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Abstract. Upper bounds on the maximum number of minimal codewords in 
a binary code follow from the theory of matroids. Random coding provide 
lower bounds. In this paper we compare these bounds with analogous bounds 
for the cycle code of graphs. This problem (in the graphic case) was considered 
in 1981 by Entringer and Slater who asked if a connected graph with p vertices 
and q edges can have only slightly more that 2'^~P cycles. The bounds in this 
note answer this in the affirmative for all graphs except possibly some that 
have fewer than 2p + 31og2(3p) edges. We also conclude that an Eulerian 
(even) graph has at most 21 ~P cycles unless the graph is a subdivision of a 
4-regular graph that is the edge-disjoint union of two Hamiltonian cycles, in 
which case it may have as many as 2i~P + p cycles. 



1. Introduction 

The codewords of a binary code form a poset under support inclusion. The 
codewords that are minimal for that partial order are called minimal. They include 
the minimum weight codewords, but do not coincide, in general, with them. They 
occurred in decoding studies [1] , and independently in secret sharing schemes based 
on codes [3]. What is the number M(C) of minimal codewords a binary linear code 
C might have? If C has dimension fc, an immediate upper bound, which we call 
the trivial upper bound, is M{C) < 2^ — 1. This bound is met with equality for 
intersecting codes, i.e. codes any pair of codewords of which intersect nontrivially 
[5]. Conversely, any code meeting that bound with equality is intersecting. 

If G is a connected graph on p vertices with q edges, then its cycle code C{G) 
has length n — q, and dimension k ^ q — p + 1. The minimal codewords of C{G) 
are exactly the incidence vectors of cycles (that is, circuits in the cycle matroid) 
in G. Thus the above question restricted to the graphic case asks how many cycles 
a graph with p vertices and q edges can have. (We allow graphs to have multiple 
edges but no loops.) This question was raised in 1981 by Entringer and Slater 
[8j who observed that a graph with p vertices and q edges cannot have more than 
29-p+i cycles because of the trivial upper bound. They gave examples showing 
that it may have slightly more that 2''~'p cycles and asked if these examples were 
close to best possible. In this paper we verify this for all graphs except possibly 
some that are sparse. 

We view this in a coding theoretic context as follows: We denote by ^{R) the 
asymptotic exponent of the maximum of M{C) for families of codes C of rate R, 
where R is the limsup of k/n. Formally, let C[n, k] denote the set of all [n, k] codes. 
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and let 

M{n,k) = niax{Af(C) : C e C[n,k]}. 

We now introduce 

= limsup — log2 M{n, 

n— ^oo IT' 

If, in this definition, we replace C[n,k] by the cycle codes of graphs with p 
vertices and q edges (where n — q, and fc = g — p + 1) we obtain Mg(n, k), fj.g{R), 
respectively. 

By the trivial upper bound, n{R) < R- For R e [0,0.5] random coding shows 
that the bound is tight. For R > 0.5 the trivial upper bound can be improved using 
matroid theory for R> Rq with Rq f« 0.77. 

In this paper we prove an upper bound for the number of cycles in a graph with 
p vertices and q edges. That upper bound implies that 

for R < 0.5, and 

fig{R)^^{l~R)\og,{l-R) 

for 0.5 < i? < 1. 

It also implies that every graph with more than 2p + 0(log(p)) edges has fewer 
than 2''~P cycles. The graph 2Cp (the cycle of length p with every edge doubled) 
is 4-regular, and has therefore 2p edges, and has precisely 2'^~p + p = 2p + p cycles. 
This shows that we cannot omit the 0(log(p)) term above. But we may be able to 
omit it if we the raise the bound 2'^~p slightly. We show that for this, it suffices to 
investigate a very special class of graphs, namely those 4-regular graphs which are 
the union of two Hamiltonian cycles. 

2. Known bounds on M{n,k),M{C) and /i(i?) 
In this section we review some known bounds. 

If C is an [n, k] code, then we have M(C) <2^ -\. We call this bound the trivial 
upper bound. It is easy to see that a binary linear code C meets the the trivial 
bound with equality if and only if it is intersecting. For, if C is not intersecting, 
then two of its codewords, say c and d have disjoint supports. Their sum c + d 
is nonzero and non-minimal. Hence M(C) < 2*^ — 1. Conversely, if C has a non- 
minimal codeword, then it can be written as a sum of at least two disjoint support 
minimal codewords. Therefore C is not intersecting. 

The trivial upper bound implies that /J,(i?) < R. 

It is proved in [7] that, for a binary matroid on n points of rank n — k represented 
by an [n, k] code C say, we have 
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This implies that < H{R), where H is Shannon's binary entropy function 

defined for x G [0,1] by 

H{x) = -xhg^x- (1 - a;)log2(l - x). 

That upper bound is better than the trivial upper bound for i? > i?o ~ 0.77 where 



By averaging arguments (random coding) it was shown in |31 Cor. 2.5] that for 
R < 1/2, we have 

> R, 

and for i? > 1/2, we have 

KR) > H{R) -l + R. 



Finally in this section we state, as Theorem 1, a bound for M{n,k) from [U 
Theorem 5]. We refer to this bound as the Agrell upper bound. 

Theorem 1. For > ^ we have 

2k 



M{n,k) < 



□ 



3. Cycle codes of graphs. 

It is a long-standing and difficult problem in graph theory to find the maximum 
number of cycles a connected graph on p vertices and with q edges can have. This 
problem was raised by Entringer and Slater [8] who observed that no connected 
graph G can have more than 2''^^+^ cycles. This follows from the trivial upper 
bound because there is a binary [q,q~p+l] code C{G) called the cycle code of the 
graph. Its codewords are defined on the edge set and are the indicator vectors of the 
edge disjoint unions of cycles. The minimal codewords of C(G) are the indicator 
vectors of the cycles of G. Entringer and Slater [8] also observed that there are 
graphs having slightly more than 2'^~p cycles and asked if this is (essentially) the 
maximum. Finally, Entringer and Slater [5] observed that the maximum is attained 
for cubic graphs. 

The first bound significantly below the trivial bound was obtained by Aldred 
and Thomassen [2] who proved that no connected graph G can have more than 
1|2''~^'+^ cycles. This is the best known upper bound for cubic graphs. But, for 
graphs of average degree > 4 there are better bounds, and in fact, the question by 
Entringer and Slater has been answered for all graphs of average degree slightly 
greater than 4. The Agrell bound (Theorem 2) immediately implies the following. 

Corollary 1. If G is a connected graph on p vertices and with q edges satisfying 
q > 2p, then its number of cycles is at most 

{q-2p)^' 

□ 

We shall prove the following. 
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Theorem 2. If G is a connected graph on p vertices and with q edges, and we write 

q — 1 = {p — l)m + r 

where m, r are nonnegative integers, and < r < p — 1, 
then its number of cycles is at most 

gmf-l-''(m+l)^ 

□ 

If wc fix the ratio q/p and let p tend to infinity, then the bounds in Corollary 
1 and Theorem 2 are essentially exponential functions. The exponential function 
in Theorem 2 is in a sense best possible, as we point out below. Also, both results 
answer the question of Entringcr and Slater (asking if a graph can have significantly 
more than 2'^~p cycles) for graphs of average degree slightly more than 4. Corollary 

1 shows that counterexamples (if any) can have at most 2p+0(^) edges. Theorem 

2 goes further and says that they must have at most 2p + 0{\og{p)) edges. It would 
be interesting to answer the question for all graphs of average degree at least 4. 

Theorem 2 is a consequence of Theorem 3 below. 

A path is a graph with vertices vi. V2, .... Vp and edges viV2. ?'2?'.'b. ■ . • , '-"p-i^p- 
A multipath is obtained from a path by replacing some edges witli multiple edges. 
Let f{q,p) denote the maximal number of paths from Vi to Vp in a multipath with p 
vertices and q edges. This maximum is attained if no two edge multiplicities differ 
by more than 1. So, if p — 1 divides q, then 

f{q,p) = iq/ip-W-'- 

Lemma 1. If x,y are vertices in a graph G with p vertices and q edges, then G 
has at most f{q,p) paths from x to y. 

Proof: The proof is by induction on p. lip = 2 the statement is trivial. So assume 
p > 2. Let d denote the degree of x, and let xi,X2, . . . ,Xd be the neighbors of 
X. (Some of these neighbors may be identical.) By induction, G — x has at most 
f{q — d,p — 1) paths from a;, to y, for each i = 1,2, . . . ,d. So G has at most 
df{q — d,p—l) paths from x to y. As f{q — d,p—l) is the number of paths between 
the ends in a multipath with p — 1 vertices and q — cl edges, we may interprete 
df{q — d,p — 1) as the number of paths between the ends in a multipath with p 
vertices and q edges, where the first edge multiplicity in the multipath is d. By the 
maximum property of f{q,p), we have 

df{q-d,p- 1) < f{q,p), 
which completes the proof. □ 



Theorem 3. Letp,q be natural numbers > 2. 

There exists a graph with p vertices, q edges, and at least f{q— l,p) cycles. 
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// G is any graph with p vertices and q edges, then G has at most 

qf{q - 

cycles. 



Proof: 

By the definition of /, there exists a multipath with p vertices, q — I edges, and 
precisely f{q — paths between the ends. If we add an edge between the ends 
we get a graph with p vertices, q edges, and at least f(q — cycles. 

To prove the last statement, consider any edge e = xy in G. The number of 
cycles in G containing e is the number of paths in G — e from x to y. By Lemma 
1, this number is at most 

This completes the proof. □ 



Because of the logarithm in the definition of /i, Theorem 3 gives the right /i- value 
for the graphs of a fixed average degree. It is still interesting, though, to decide if 
the bound qf{q— can be lowered to about (q/p)^. Does the maximum number 
of cycles occur in graphs similar to tCpl Are there graphs of average degree 2t 
without multiple edges that have the same number or a larger number of cycles? 
These questions are open even for cubic graphs. As mentioned earlier, Entringer 
and Slater [8^ observed that a cubic graph on p vertices may have as many as 2^^^ 
cycles. It has been open for several years if this is close to the right number. For 
planar cubic graphs this was verified by Aldred and Thomassen [2]. For general 
cubic graphs they lowered the trivial upper bound 2^*/^+^ to (15/16) • 2^/^+^. 

We can now answer the question by Entringer and Slater |8] for all graphs with 
average degree slightly more than 4. 

Theorem 4. Let G be a graph with p vertices and q edges. If q > 2p + 3 log2 (3p) , 
then G has at most 2'^*' cycles. 

Proof: Consider first the case where q — 2p. The upper bound on the number 
of cycles provided by Theorem 2 is {2p) ■ 3 • 2^^^, which is more than 2'^^p = 2^. 
However, if we increase q to 2p + r, then the upper bound increases to (2p + r) • 3 • 
2P-2 . (3/2)''. This number is < 2?+*^ for r > 31og2(3p). 

This completes the proof. □ 

4. Applications to Hg{R). 
We can now determine HgiR) completely. 

Theorem 5. For < R < 0.5, 

^lg{R) = R. 

If R is of the form 1 — 1/t where t is a natural number > 2, then 
pig{R)^-{l-R)\og,{l~R). 
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The function iJ,g{R) is continuous and linear in each closed interval from 1 — 1/t 
to 1 — l/{t + 1), where t is a natural number > 2. 



Assume first that < R < 0.5. For any two natural numbers p, r we let Cp^r be 
obtained from 2Cp by subdividing one edge r times. Then Cp^r has p + r vertices 
and n = 2p + r edges. Thus the dimension of the cycle code is = p + 1, and the 
rate of the cycle code is k/n = {p + l)/(2p + r). For each natural number p we 
let r be the largest natural number such that k/n = {p + l)/{2p + r) > R. Then 
k = [i?n]. Also, r = [{p + 1)/R — 2p\. Recall that the number of cycles in 2Cp 
and hence also in Cp^r is > 2^*. Substituting these values in the definition of fXg and 
letting p tend to infinity, wc conclude that > R. 

The trivial upper bound shows that this inequality is, in fact, an equality. 

Consider next the case where R = 1 — 1/t for where t is natural number > 2. 

Let tCp be the cycle of length p where each edge has been duplicated t times. This 
graph has p vertices and q = pt edges. The number of cycles in this graph is 



the first term counting cycles of length p and the second cycles of length 2. The 
graph is regular of degree 2t. Hence it has n = pt edges and has rate {pt—p+l)/pt = 
1 — 1/t — 1/pt. Hence its cycle code is an [n,Rn + l]-code. If we delete an edge, 
then we get an [n — 1, \R{n — l)]]-code. Deleting an edge reduces the number of 
cycles only slightly. Letting p and hence also n tend to infinity, we conclude that 



Now assume that 1 — 1/t < R < 1 — l/(t + l), where t is a (fixed) natural 
number > 2. then we let G{p, R, r) denote the graph obtained from tCp by adding 
r edges between neighboring vertices such that all edge multiplicities are t or t + 1. 
The resulting graph G{p, R, r) has p vertices and n = pt + r edges. Thus the 
dimension of the cycle code is fc = p{t — 1) + r + 1, and the rate of the cycle code 
is k/n = {p{t -l)+r + l)/{pt + r). 

We first choose any p so large that the rate of tCp, which is (pt — p + l)/pt = 
1 — 1/t — 1/pt, is smaller than R. Then we let r be the largest natural munber such 
that the rate of G{p, R, r), which is (p(t — 1) + r + l)/(pt + r), is smaller than or 
equal to R. That is, r = —pt + [(p — 1)/(1 — R)\ . Then the cycle code of G(p, R, r) 
is an [n, [i?n] ]-code, where n = pt + r = [{p — 1)/(1 — -R)J is the number of edges 
of G(p,i?,r). 

The number of cycles in G(p, R, r) is > t^^'^it + l)*". 

If we substitute these values in the definition of /Xg and let p tend to infinity, 
then we conclude that 



Proof: 




^xg{R) > -(1 



R)\og^{l-R). 



lig{R) > ((1 - R){1 + 1) - 1) log(t) + (-t(l -R) + l) log(t + 1). 
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The right hand side is clearly a linear function. For R equal to 1 — l/< or 
1 — + 1) the right hand side has the same values as the lower bounds we 
obtained for those two values of R. So we have obtained a lower bound for ^J-g{R) 
which is continuous and piecewise linear. 

We claim that this lower bound is also an upper bound. We used the graph 
G{p,R,r) above. If we put q = n — pt + r, then G{p,R,r) has the maximum 
number of cycles among those graphs with q edges which are obtained from a cycle 
of length p by duplicating edges. The graphs used to give the lower bound f{q—l,p) 
in Theorem 3 are also graphs of this type. Hence G{p, R, r) has at least f{q — 
cycles. On the other hand, Theorem 3 says that any graph with p vertices and q 
edges has at most qf{q — l,p) cycles. So no graph with p vertices and q edges has 
more than q times as many cycles as G(p, R, r). Hence the lower bound for ^ig{R) 
obtained from the graphs G(p, R, r) is also an upper bound. □ 

The function fig{R) is less than the matroid upper bound for all R and also less 
than the random upper bound for R > 0.5. The Agrell upper bound gives the same 
upper bound on fi as the trivial upper bound. Figure 1 shows these bounds. 

5. Cycle codes of 4-regular graphs 

Cycle codes of cubic (that is, 3-regular) graphs have enjoyed particular attention 
because in order to answer the afore-mentioned question by Entringer and Slater 
[5], it suffices to consider cubic graphs. The investigations in this paper indicate 
that the 4-regular graphs also deserve attention. First of all, their cycle codes have 
rate R « 0.5, and this is the smallest value of R for which the function iig{R) 
changes shape. Here we shall provide another reason. Although it is merely an 
observation we call it a theorem because of the striking exceptions that appear in 
the statement. A graph is Eulerian if it is connected and all vertices have even 
degree. These graphs are particularly interesting in the present context because 
the vector 1 consisting of ones is a code word. Recall that a Hamiltonian cycle in 
a graph is a cycle containing all vertices. 

Theorem 6. // G is an Eulerian graph with p vertices and q edges, then G has at 
most 2''^^ cycles unless G is a subdivision of a A-regular graph which is a union of 
two Hamiltonian cycles. 

Proof: The map sending a codeword x into the codeword x -|- 1 is a map from the 
cycle code into itself. If the codeword of every cycle is mapped into a codeword 
which does not correspond to a cycle, then at most half of the 2'"''+^ codewords 
correspond to cycles, and the result follows. So we may assume that G has a cycle C 
such that both its codeword x and also the codeword x-|-l correspond to cycles, say 
C, C". Hence G cannot have a vertex of degree 6 or more. Each of C, C" contains 
precisely two edges incident with each vertex of degree 4. And each edge of G 
belongs to precisely one of C, C". So C, G' is a partition of G into two Hamiltonian 
cycles after the vertices of degree 2 have been replaced by edges. □ 
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Figure 1. ^(i?) versus R 

6. Conclusion and Open problems 

Figure 1 shows as a function of R, 

• the trivial upper bound i?, 

• the matroid upper bound H{R) = — i?log2(i?) — (1 — i?) log2(l — R) 

• the random lower bound H{R) — 1 + R 

• the function /ig(-R) for cycle codes of graphs. 

The function fJ.g{R) is known exactly. It is /ig(i?) — R ior R < 0.5, and lJ.g{R) = 
— (1 — R) log2(l — R) for R > 0.5. The function /i seems more problematic. Thus 
= R for R < 0.5, while for R > 0.5 we only have the bounds 

H{R) -1 + R< ^(i?) < min(i?, H{R)) 

the best known upper bound for R > 0.5. These bounds can be generalized to 
linear codes over a non-binary alphabet. 

By [5] long linear intersecting codes can exist only for R < 0.283. Thus the 
codes of rate between that value and 0.5 provided by random coding are "almost" 
intersecting. 

While we have found it is a wide open problem to find Even the 

following seems nontrivial. 

Open Problem 1. Is fJ-{R) a continuous function of Rl Is it concave? For which 
R is it maximum? 

Analogous questions have been considered by Manin 

We have answered Problem 1 for ^g{R). This function has maximum for R = 
2/3. The 6-regular graphs have rate ~ 2/3, so maybe also the 6-regular graphs are 
worth studying in more detail. 

As mentioned earlier, the 4-regular graph 2Cp has 2p +p cycles, and no 4-regular 
graph has more than 2^+^ cycles, by the trivial upper bound. 

Open Problem 2. Does there exist a A-regular graphs with p vertices and more 
than 2P + p cycles? 
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In case the answer is negative, then it seems that the 4-regular graphs are the 
only regular graphs for which there is a simple expression for the maximum number 
of cycles. 

Open Problem 3. Does there exist a real number c < 2 such that every A-regular 
graphs withp vertices and no multiple edges has less than 0^*+^ cycles? 

As mentioned above, tCp has p vertices, q = pt edges, and t^ +^(2) cycles. 

Open Problem 4. Does there exist a graph with p vertices, q = pt edges (where t 
is a natural number), and more than t^ +p{^ cycles? 

Acknov^rledgement: The authors are indebted to G. D. Cohen and H. Ran- 
driam for helpful discussions. 
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